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COUNTING REAL CRITICAL POINTS OF THE DISTANCE 
TO ORTHOGONALLY INVARIANT MATRIX SETS 

DMITRIY DRUSVYATSKIY, HON-LEUNG LEE, AND REKHA R. THOMAS 


Abstract. Minimizing the Euclidean distance to a set arises frequently in 
applications. When the set is algebraic, a measure of complexity of this 
optimization problem is its number of critical points. In this paper we provide 
a general framework to compute and count the real smooth critical points of a 
data matrix on an orthogonally invariant set of matrices. The technique relies on 
“transfer principles” that allow calculations to be done in the space of singular 
values of the matrices in the orthogonally invariant set. The calculations often 
simplify greatly and yield transparent formulas. We illustrate the method on 
several examples, and compare our results to the recently introduced notion of 
Euclidean distance degree of an algebraic variety. 


1. Introduction 


Finding an element of a subset V in K" closest to a specified point y is a 
common task in computational mathematics, often called the Euclidean distance 
(ED) minimization problem: 


( 1 . 1 ) 


minimize subject to x GV. 


Our current work is motivated by the systematic study of the “critical points” of the 
problem in an algebraic setting, initiated in [12] and continued in 


11 


19,26 


There, the basic assumption is that V is a real variety — zero set of finitely many 
polynomials with real coefficients. Consider now the set Vc of complex points 
satisfying the defining equations of V. Then a eritical point of y S C" with respect 


to (1.11 is any smooth (possibly complex) point x of Vc such that y — x lies in the 


normal space of Vc at x, meaning that y — x lies in the span of the gradients of the 
defining equations a.t x. It was shown in [T^ that for a general data point y gC^, 


the number of (complex) critical points of (1.11 is a constant. This constant is called 
the Euclidean distance degree of V, denoted by EDdegree(V), and is a measure of 
the algebraic complexity of expressing a minimizer of (1.1) as a function of y. 


The work in this paper is geared towards understanding the (real) critical points 
of orthogonally invariant matrix sets. Our theme is best illustrated with an example. 
Fix positive integers r < n < t and consider the matrix set 


:= {Ae 


rank(A) < r}. 


Finding the closest matrix of rank at most r to a given matrix Y arises in many 
applications. The set is a real variety, and the authors of 12 established that 
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EDdegree(M"^‘) = (”). They also provide a recipe for all the critical points of Y 
on which may be viewed as a generalization of the Eckart-Young theorem. 

In the context of this paper, what is important about is that it is orthogonally 
invariant, meaning that if X S then for all orthogonal matrices U and 

V of appropriate sizes, UXV^ is also in Alternately, membership of X 

in the set is determined solely by the vector of singular values cr(A) = 

(tTi(A),... ,(t„(A)). Indeed, a matrix X lies in if and only if its vector of 

singular values a{X) lies in the set 

K” := {a: e K” : rank(a;) < r}, 


where rank(a;) denotes the number of nonzero coordinates of x. Observe that the 
geometry of the piecewise linear set R" is much simpler than that of the highly 
nonlinear set In particular, M" is also a variety and EDdegree(K") = ("). 

The equality EDdegree(]R") = EDdegree(R"^‘) is not accidental; in this paper, we 
elucidate this phenomenon. 

As alluded to above, our focus in this paper is on a (naturally defined) real analog 
of critical points for (1.11. Namely, we say that x € R" is a (real) ED critical point 
of y S R" relative to a set V C R" (not necessarily a variety) if a: is a smooth point 
of V and y — x lies in the normal space to V at a;; see Definition |2.1| for details. 
Our main result shows that one can always obtain the ED critical points of an 
orthogonally invariant matrix set by restricting to diagonal matrices, or equivalently, 
to an absolutely symmetric set obtained from the singular values of the matrices. 
In the last section, we explore the connection between ED critical points and the 


critical points in the sense of 12 that happen to be real, a surprisingly subtle topic. 


In particular, ED critical point calculations often help to understand EDdegree(V), 
when V is a variety. 

Sets constrained via their singular values are numerous in applications. Define 
8 C R^^^ to be the set of rank deficient matrices with two equal singular values: 

: ai{X) = a2{X), a3(A) = 0}. 


( 1 . 2 ) 


8-.= {X& 


A matrix A £ f is called an essential matrix in 3D computer vision and represents 
a pair of calibrated pinhole cameras 17 Chapter 9]. The set 8 also happens to 


be a real variety cut out by the following ten cubic equations in the entries of 
X Proposition 4]: 


(1.3) 


det(A)=0, 2XX^X-tv{XX^)X = {). 


Observe that 8 is orthogonally invariant as membership of a matrix X in 8 depends 
only on its singular values. We will show that one can obtain the ED critical points 
of 8 from the simpler set E ^^2 of vectors in R^ with one coordinate zero and the 
other two equal in absolute value. 

The idea of studying orthogonally invariant matrix sets M. C R"^‘ via their 
diagonal restrictions S = {x : Diagx e Al} is not new, and goes back at least to von 
Neumann’s theorem on unitarily invariant matrix norms [30| . In recent years, the 
general theme has become clear: various analytic properties of M and S are in one- 
to-one correspondence. This philosophy is sometimes called the “transfer principle”; 
see for instance, [^. For example, A4 is C^-smooth around a matrix X if and only 
if S is C^-smooth around cr(A) [^ [^[^[^ . Other properties, such as convexity [^, 
positive reach j^, partial smoothness [^, and Whitney conditions 13 follow the 
same paradigm. We note in passing that the setting of eigenvalue constrained sets 
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of nonsymmetric matrices is more complicated; see e.g. [3 23 . In the current work, 
we derive a transfer theorem for the ED critical points of an orthogonally invariant 
matrix set M. and illustrate it on several examples. The manuscripts [7 22 24 play 
a central role in our work. The transfer paradigm has many useful features. The 
first is that in many instances, the calculation over the set S C M" is simpler than 
the original one over M.. This can make the formulas for the number of ED critical 
points of M. much more transparent as compared to the direct calculation in matrix 
space. The case of is an example of this. Secondly, our method focuses on 
(real) ED critical points as opposed to all complex critical points in the setting 
of algebraic varieties. This is useful in applications and allows us to gauge the 
difference between the real and complex situations (under appropriate conditions). 

This paper is structured as follows. In Section we establish basic notation and 
define the key notion of ED critical points of a set in R" with respect to a data 
point y. Then in Section]^ we derive our main theorem that transfers the study of 
ED critical points of an orthogonally invariant set of matrices to that of its diagonal 
restriction. Section illustrates the technique on some concrete examples deriving 
formulas for the number of ED critical points in each case. Finally in Section]^ we 
establish the relationships between our work and the results in [^, by restricting 
to algebraic varieties. In particular, we compare our formulas to those for EDdegree 
in several instances. We note in passing, that many of the results in the paper also 
hold for complex matrices with respect to the standard Hermitian inner product. 


2. ED CRITICAL POINTS OF SUBSETS OF K” 

Throughout this paper we consider the n-dimensional Euclidean space M", with 
a fixed orthonormal basis. Let (•,•) denote the inner product in this setting, and 
II • II denote the induced norm. The distance and the projection of a (data) point 
y G R" onto a subset S C R", are defined by 

dists(?/) := inf lly-xll, and 

xGS 

projs(y) ■■={xe S : dists(y) = \\y - a:||}. 

Computing the distance of y to 5" amounts to solving the optimization problem: 


( 2 . 1 ) 


inf 
x^s 2 


1 1 

||y - a;f = inf - '^{y^ - Xi)^. 


res 2 


i=l 


A classical first-order necessary condition for a putative point a; S S' to be optimal 


for (2.11 is that the gradient of the object function, namely x — y, makes an acute 
angle with every vector v in the tangent con|^ 

Zi- X 




\Zi - xll 

One can regard such points as generalized critical points of the distance minimization 


problem (2.1). On the other hand, in order to compare and unify our work with 
that in (H , we will impose an extra smoothness condition on a point a: G S in order 
for it to be considered critical for (O- To this end, throughout the manuscript, 
we fix p € {2,3,..., 00 , 0 ;}, and say that a point x G S \s -smooth if there is a 
neighborhood of x such that S C D is an embedded C^-smooth manifold (recall 
that C“ means real analytic). In this case, the tangent cone Ts{x) is the usual 


^If X is an isolated point of S, then Ts^x) is declared to consist only of the origin. 
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Figure 1. A parabola in the plane with its evolute or ED discriminant. 

tangent space in the sense of differentiable manifolds, and the criticality condition 
above amounts to the inclusion y — x G A/s(x), where JVs{x) denotes the normal 
space to S' at a: — the orthogonal complement of Ts{x). 

From now on we abbreviate “C^-smooth” to “smooth”. We will use S* to denote 
the set of all smooth points of S. Here is then our main definition, attuned to the 
one considered in [T^ . 

Definition 2.1 (ED critical points). Consider a set S C M" and a point y in K". 
A point a; G S is an ED critical point of y on S if the following conditions hold: 

(i) X G S*, and 

(ii) y-xG Msix). 

The symbol Cs{y) will denote the set of all ED critical points of y on S, while the 
cardinality of Cs{y) will be denoted by Cf{y). 

By definition, all ED critical points of y on S are real and smooth. The number 
of ED critical points (y) varies with y. For example, consider the parabola S 
shown in Figure with an additional curve called its ED discriminant or evolute. 
All points y above the evolute have three ED critical points while below the evolute 
they have one ED critical point. Since S is an algebraic variety, we can compute its 
EDdegree which is three, i.e.. Sc has three distinct regular complex critical points 
almost everywhere. We will comment more on the ED discriminant in the Appendix. 

Letting 0 be the collection of all Lebesgue null subsets of K", the following 
worst-case measure of criticality arises naturally: 

C*{S) := inf sup C'|(y), 

re© 

where F"^ is the set complement of F in the ambient space. Indeed, intuitively 
if one believes that any single zero measure set F can be discarded, then C'^(«S') 
measures the maximal value of C"g (y) that has a non-negligible chance of being 
encountered. For our purposes, one could think of (7^(5) as a real analog of the 
EDdegree considered in [^. We discuss this further in Section]^ 

We note in passing that this criticality measure can be infinite in pathological 
situations (e.g. union of countably many co-centric circles in M^). On the other 
hand, for structured sets, such as those that are semi-algebraic, this number is finite. 

Proposition 2.2 (ED critical points of semi-algebraic sets). For any semi-algebraic 
set S in M", the number C“^(5') is finite. 

Proof. Standard quantifier elimination shows that the set S* is semi-algebraic. 
Define the manifolds Mi := {x G S* : dim7s(a^) = i} for f = l,...,n. Again, 
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quantifier elimination shows that the normal bundle 

i 

is a semi-algebraic subset of K" x K” having dimension n. Consider now the 
mapping (/>: —>• K" defined by (j){x,v) = x + v. Notice that Cs{y) coincides with 
the projection of the preimage onto x. For dimensional reasons, there is a 

full-measure set D C M" so that for every y G D the preimage (j)~^{y) has finite 
cardinality. Moreover, since preimages of a semi-algebraic map have a uniformly 
bounded number of connected components (see e.g. Theorem 3.12]), the quantity 
Cf{y) is uniformly bounded on D. The result follows. □ 


Next we consider ED critical points on a union of finitely many sets. Consider 
a finite collection of sets {Si}i^x in M” and define the union U := 
general, the two sets Cu{y) and UiGi^'Si(y) can be vastly different because of the 
way the sets Si intersect. For instance, think of two half spaces whose union is K". 
A simple situation in which more can be said is when locally around each critical 
point X S Cu{y), the set U coincides with Si for some * S I. In that situation, if 
X € Cu{y) then x is also in Cs^iy). An important situation in this paper is the case 
of all Si being affine subspaces. We say that a finite collection of sets {Si}i(zi in K” 
is minimally defined if no Si is contained in any Sj for distinct indices i and j. 


Proposition 2.3 (ED critical points of affine complexes). Consider a finite col¬ 
lection of affine subspaces {Si\i^i in K", that is minimally defined, and let U := 
UzGl Si- Then we have 

(2.2) U* = {x G K" : there exists unique i Gl with x G Si}. 

Consequently for any y G M", we have 

(2-3) Cu{y) = IJ (projs_(y) C U*) , 


iGX 


and the equality 


C*{U) = |Z|. 


Proof. The inclusion D in (2.2) follows since the sets Si are affine. To see the reverse 


inclusion, observe that the tangent cone to U at any point x coincides with the 
union Uigi-sgS —x). For x G U*, the cone Tu{x) is itself a linear subspace, and 
hence, by the minimality of the collection {Si}i^x, there exists a unique index i gX 
satisfying x G Si. This establishes ( |2.2[ ). Equation (2.3) then follows immediately. 

To see the last claim, consider the set 

Z := {y e K" : G I with projg. (y) PU* = 0}. 

We will show that Z is a finite union of proper affine subspaces of M". To see this, 
consider a point y G Z along with an index i gX satisfying projg. (y) C\U* =0. From 
(2.2) , we conclude that there exists j G X, distinct from i, satisfying proj 5. {y)r\Sj f 0. 
Thus y lies in the set S:^- -I- {Si PSj). Since the collection {Si} is minimally defined, 
the intersection Si CiSj has dimension strictly smaller than that of Si. Consequently 
the affine space S^ + {Si n Sj) has dimension strictly smaller than n. Taking the 
union over all pairs of distinct indices i,j GX, we deduce that Z is a finite union of 
proper affine subspaces of I 


proves that C^(C/) = jlj. 


Therefore, by (2.3), Cff{y) = jlj for all y ^ Z which 


□ 
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Figure 2. the set 


The following two elementary examples illustrate Proposition |2.3[ these are 
essentially the piecewise linear examples alluded to in the introduction. In the next 
section, we will use them to obtain the ED critical points of the (nonlinear) matrix 
sets and S. In what follows we set [n] n}. 


Example 2.4 (Union of r-dimensional coordinate subspaces). For x G K", define 
the rank of x, denoted by rank(a:), to be the number of nonzero coordinates of x. 
Fix an integer r € [n] and recall the set 


(2.4) K” = {t e K” : rank(x) < r}. 

Define X to be the collection of distinct cardinality r subsets of [n]. Then \X\ = 
and 

= U span{ei}igs, 

Sex 

where e,; denotes the i’th coordinate vector in K". This representation of M' 


IS 


minimal and therefore, Proposition 2.3 implies the equality C''^(IR)!') = \I\ = ("). 


Example 2.5 (fc nonzero entries equal up to sign). Fix an integer k G [n], and 
let En,k be the set of points in R” with the property that k of their coordinates 
are equal in absolute value and the other n — k coordinates are zero. Note that 
En,k can be written as the union of linear subspaces (minimally defined). 

then implies that C'^{En,k) = As a special case, the set 

(see Figure!^, satisfies C^(E 3 _ 2 ) = 6. 


2.3 


Proposition 
F' 3 _ 2 , which is a union of six lines in 


3. ED CRITICAL POINTS OF ORTHOGONALLY INVARIANT MATRIX SETS 


In this section we describe our main result which yields an elegant technique 
for counting the ED critical points of orthogonally invariant matrix sets, based on 
the tools established in [7 22 24 . Setting the notation, let denote the set of 


real n x t matrices, where we assume without loss of generality that n < t. The 
singular value map a: —?> R" assigns to each matrix X G R"^* the vector of 

its singular values (y{X) := ((Ti(X), ... ,cr„(X)) arranged in non-increasing order. 
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The corresponding inverse map is defined as 

a-\S) := {X e : a{X) e S} for any subset S of K”. 

We will be interested in subsets of that are invariant under multiplication 
on the left and right by orthogonal matrices. In what follows, we let O® denote the 
group of real s x s orthogonal matrices. 

Definition 3.1 (Orthogonal invariance). 

A set At C is orthogonally invariant if 

M = UMV^ for all U andVG OK 


An example of an orthogonally invariant matrix set is the essential variety 

£ = {AeK3x3 . ai{X) = a2{X),a3{X) = 0} 

considered in the introduction. At first sight, £ is a complicated set; it is a highly 
nonlinear real variety cut out by the cubic polynomials in (1.3). Since orthogonally 
invariant matrix sets are precisely those matrix sets for which membership is 
determined solely by the singular values of its elements, the restriction of such sets 
to the subspace of diagonal matrices plays an important role. Our strategy will 
be to exploit this observation to calculate the ED critical points of orthogonally 
invariant matrix sets. 

A mapping tt : [n] —>■ {±1,..., ±n} is a signed permutation if the assignment 
i i-A |7r(7)| is a permutation on [n] in the usual sense. We let 11)^ denote the set of 
signed permutations on [n]. Note that any signed permutation tt G induces a 
linear map K" —)■ K” which we also denote by tt. 


Definition 3.2 (Absolute symmetry). A set S C R" is said to be absolutely 
symmetric if 

S = ttS for all tt G II)^ . 

For any set S' C M”, we call 11)^S := {ttx : TT G II)^ and x G S} the absolute 
symmetrization of S. 


For a vector x G R", let Diagx G R”^‘ denote the matrix with x in its principal 
diagonal and zeros elsewhere. If At C R"^* is orthogonally invariant, then the set 
{x G R" : Diagx € At} is absolutely symmetric, and we have the following basic 
observation (see e.g. Proposition 5.1]). 

Theorem 3.3 (Diagonal Correspondence). A set At C R”^‘ is orthogonally in¬ 
variant if and only if there exists an absolutely symmetric set S C R" such that 
At = o-i(S). 


Thus the assignment from the family of absolutely symmetric sets in R" to 
the family of orthogonally invariant sets in R"^‘ is a bijection. As discussed in the 
introduction, it so happens that various analytic properties of absolutely symmetric 
sets S and orthogonally invariant sets cr“^(«S') are in one-to-one correspondence; 
see e.g. |6}|^ [T^[^[^[^|30] . This may seem somewhat surprising since cr is a 
highly nonsmooth mapping; it is the absolute symmetry of the underlying set S that 
makes up for the fact. As an illustration, we show in Theorem |3.4| how the transfer 
principle for algebraicity can be used to see that the set of essential matrices £ is 
a variety, even without explicitly knowing its defining equations (1.3). The proof 
is entirely analogous to the symmetric case in Proposition f.fj; we provide an 
argument for completeness. 
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Recall that a set V C K” is called a real variety if there exist polynomials 
fiT ■ ■ ,fs G K[a:i, ■ • •, Xn] such that V = {a; S K" : fi{x) = ...= fsix) = 0}. Note 
that if a; G V, then X]i=i fii^) = 0; conversely if a point a; G K" satisfies 
J2i=i fii^) = 0, then fi{x) = 0 for alH = 1,..., s, and hence, x G V. Therefore, V 
can be described as the set of real zeros of a single sum of squares polynomial with 
real coefficients, which we call a defining polynomial of V. 


Theorem 3.4 (Transfer of algebraicity). Suppose S C K" is an absolutely symmetric 
set. Then a~^{S) C is a real variety if and only if S is a real variety. 


Proof. If cr“^(S') is a real variety, then 5 = {x G K” : Diagx G cr“^(S')} is the 
real variety cut out by the equations defining cr“^(5') when Xtj are set to zero for 
all i j. Suppose conversely that S' is a real variety with defining polynomial 
/ G K[xi,..., x„]. Consider the polynomial /(x) := Note that 

/(x) is also a defining polynomial of S, and hence, a~^{S) is the zero level set of 
f o a. To hnish the proof, we just need to show that / o ^{X) is a polynomial in 
the entries of X. To this end, since / is invariant under sign changes, it is easy 
to see that / is a symmetric polynomial in the squares xf,..., x^, that is we may 
write /(x) = gixf, ..., x^) for some symmetric polynomial g. By the fundamental 
theorem of symmetric polynomials (see e.g. 25 ) we may write 5 as a polynomial of 
elementary symmetric polynomials Ci,..., e„. On the other hand, the expressions 
ei{af{X),... ,a'^{X)) coincide with the coefficients of the characteristic polynomial 
of X^ X and are hence, polynomial expressions in the entries oi X. □ 


Another useful illustration of transfer principles concerns the distance to orthogo¬ 
nally invariant matrix sets. Recall that for a set S C M", the distance and projection 
of a point y G K" to (respectively, onto) S are defined by dist 5 (j/) := inf 2 ,gs Hj/ —x||, 
and proig{y) := {x G S : dists(?/) = \\y — x||}. (The distance and the projection in 
the matrix space are defined analogously with respect to the Frobenius norm 
II^11 •= ^ij-) Then for an absolutely symmetric set S' C M”, the following 

holds Proposition 8]: 

(3.1) dist,^-i(s)(y) = dists(CT(r)). 

This in turn implies the following result, which was essentially proved in Proposi¬ 
tion 8], though not formally recorded. We provide a proof sketch for completeness. 

Proposition 3.5 (Projections onto orthogonally invariant matrix sets). If S C M" 
is an absolutely symmetric set, then for any matrix Y G , the projection 

projo.-i( 5 )(y) is precisely the set 

(3.2) (t7(Diagx)P^ : U gO^,VgO^ where ^ ^ U{Dmg a{Y))V^, 1 ^ 

I xG proj5(cr(r)) J 

Proof. Consider first matrices U G 0",V G O* with Y = [/(Diag r7(Y))V^ and a 
vector X G proj 5 (tT(y)). Define X := [/(Diagx)P^ and observe the equalities: 

||X - r|| = ||x - a{Y)\\ = dists(cr(y)) = dist,^-i(s)(y), 

where the last equality follows from ( |3.1| ). Hence the inclusion X G proj^^-i (S)(T) 
is valid, as claimed. Conversely, for any matrix X G pi'oj,,.-i(g)(Y) observe 

dist<,-qs)(T) = 11^ - T|| > lk(X) - a(y)|l > dists(a(y)), 




REAL CRITICAL POINTS OF THE DISTANCE TO ORTHOGONALLY INVARIANT SETS 9 


where the first inequality follows from the Von Neumann’s trace inequality |18[ p. 
182]; see also [24[ Theorem 4.6]. Equation implies equality throughout. In 
particular we get cr(V) G proj 5 ((T(y)). Moreover, applying the equality characteri¬ 
zation in the trace inequality Theorem 4.6], we conclude that X and Y admit 
a simultaneous ordered singular value decomposition, that is, there exist matrices 
U G 0^,V G O* with Y = 17(Diag a(Y))V^ and X = 17(Diag a{X))V^. This 
completes the proof. □ 

A consequence of Proposition |3.5| is a convenient representation of the normal 
space at a point in an orthogonally invariant manifold. First of, the set is 

smooth around X if and only if S is smooth around (j{X) (see e.g. Theorem 2.4]). 
Next recall that at any C^-smooth point a: of a set M, the following equivalence 
holds: 


zgNm{x) {x} = projM(T-I-Az) 


for some A > 0. 


Proposition |3.5| and a short computation implies that at any smooth point X of 
a~^{S) the following formula holds: 


AC-i(s)(A) = |l7(Diagz)VT : f/G C>”, V e C>‘with 


X = C/(Diagcr(X))VT^ 

z G A/]s(cr(X)) 


A more general expression without smoothness assumptions can be found in 24 


Theorem 7.1]. Using the above facts, we now derive a transfer principle for ED 
critical points which is the main result of this section. The proof uses the following 
lemma. 


Lemma 3.6 (Transfer of Lebesgue null sets). Consider an absolutely symmetric 
set S C K". Then S is Lebesgue null if and only if a~^{S) is Lebesgue null. 

We have placed the proof of the lemma above at the end of the section so as to 
not stray from the narrative. 

Theorem 3.7. (ED critical points of orthogonally invariant matrix sets) 

Consider an absolutely symmetric set S C M" and a matrix Y G along with a 

singular value decomposition Y = U{D\&g(jlY))V . Suppose moreover, that Y has 
all distinct singular values. Then for the orthogonally invariant set A4 := (t“^(S'), 
we have 

(3.3) C>i(y) = {I7(Diaga;)F^ : w G Cs(a(y))}. 

Consequently, we also obtain the equality = C"^{S). 

Proof. Consider a critical point X G Cm{Y)- Then there exist orthogonal matrices 
U and V, and a vector z G J\fs{cr{X)) satisfying 

X = U(Diagcr(X))U^ and Y = X + U {Biag z)V^. 

Hence we deduce 

Y = t7(Diag (cr(X) -f z))V^. 

By uniqueness of singular values and singular vectors, there exists a signed permu¬ 
tation matrix tt : M" —>■ K" satisfying 

cr{Y) = 7r((T(X)) -f 7r(z). 

On the other hand, it is easy to verify the inclusion 7r(z) G Afs{,T^o'{X)). Therefore, 
we obtain 7r(cr(X)) G CsicrlY)). 
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Observe 


Define now the extended signed permutation matrix tt := 
then the equality 

V = C/(Diag7r-i(CT(y)))yT ^ (ljTr)(Diaga(V))(V7r)^. 

Changing the left and the right singular vector pairs in (7 and in V by the same sign 
change in the first place, if needed, we can ensure C/tt = U and = V. Hence, 


X = U(Diaga(X))V^ = U(Diag7r(a(X)))V 


as claimed. 

Now to prove the reverse inclusion, consider a point uj G Cs((T(y)). Thus there 
exists a vector z G A/s(a;) satisfying 

cr(Y) — u) = z. 

Choose a signed permutation tt so that 7r(a;) is nonnegative and nonincreasing. 
Observe then that 

Tr{a{Y)) — = 7r(z) G 

Defining 

X := U{T)iagoj)V^ = (C/7r)(Diag7r(w))(l/7r)^, 

we deduce 


X + ([/7r)(Diag7r(z))(V’7t)^ = ([/7r)(Diag7r(cr(y)))(V7r)''^ = Y. 


The inclusion X G Cs(F) follows. This establishes equation (3.3). 
Define now the set 


A := {a; G K" : \xi\ = \xj\ for some indices i ^ j}. 


Consider a Lebesgue null set T C M". Then clearl y th e absolutely symmetric set 
r := A U n^T is Lebesgue null as well. Lemma 3.6 shows that cr“^(r) is also 
Lebesgue null. Note for all tt G H^ and y G K", one has Cs(7ry) = nCsiy) 
hence Cf{Try) = Cf{y). Combining these observations, we obtain 


C*{M) < sup Cf^{Y) = sup Cf (y) < sup Of (y), 
YG<T-i(fo vef‘= 


'<#/ 




where the middle equality follows from equation (3.3). Since T was an arbitrary null 
set, we deduce < C^(5'). 

To see the reverse inequality, consider a Lebesgue null set T C Consider 

the following subset of T: 

f := {F G r : UYV^ G T for all U gO^,V G O*}. 

Clearly T is an orthogonally invariant Lebesgue null set. Now trivially we have 


sup C^{Y) > sup Cf^{Y). 






On the other hand, it is easy to verify the equation CMiUW^) = U{CM{y))y^ 
and thus, C^iUYV^) = C^{Y) for all Y G and U gO^,V G O*. Hence 

sup C^{y) = sup Cf^{Y). 
yer- 'i'erc 
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Notice r := r U (T ^(A) is also a Lebesgue null orthogonally invariant set. Hence 
we deduce 


sup C*^{Y) > sup C# (F) = sup Ct{y) > C#(5), 
ygT'^ y6CT(r)‘= 


where the middle equality follows from equation (3.3) and last inequality follows 
from Lemma [ 3 .6[ Since L was an arbitrary Lebesgue null set, we deduce the reverse 
inequality C^{M) > C'^(5'), and hence equality, as claimed. □ 


The assumption that singular values of the data point Y are distinct is essential 
in Theorem |3.7| as the following example shows. 

Example 3.8 (Nondistinct singular values). Consider the orthogonally invariant 
set At = {X S : det(A) = 0}. We may write At = cr“^(5') where 

S := {{xi,X 2 ) € : X 1 X 2 = 0} is absolutely symmetric. Then Cs((l,l)) = 

{(1,0), (0,1)}. On the other hand, one can verify that uvJ G CMih) for each 
M G of norm one, where I 2 is the 2x2 identity matrix. 


We now apply Theorem |3.7| to our two running examples. 


Example 3.9 (Matrices of rank at most r). Fix r <n and recall the orthogonally 
invariant set 

= {X e K”''* : rank(A) < r}. 

This set is a determinantal variety cut out by the (r + 1) x (r + 1) minors of a 
symbolic n x t matrix. One sees that where M" is the absolutely 


symmetric set defined in (^2.4|. Theorem 3.7 and Example 2.4 together imply that 




This number is also the ED degree of (cf. 12 Example 2.3]); we will revisit 
this point in Section]^ In particular, the ED critical points of a general data matrix 
Y, with singular value decomposition Y = C/(Diag a(Y))V^, are the (") matrices 
obtained by setting to zero all possible choices of n — r singular values in this singular 
value decomposition. The matrix 

U (Diag(ai(y),...,a,(y),0,...,0)) 


is a nearest element of to Y. This is precisely the statement of the classical 
Eckart-Young theorem. 


Example 3.10 (Essential variety). The essential variety £ defined in (1.2) is 
orthogonally invariant with £ = cr~^ (£" 3 ^ 2 ) where E ^^2 is the absolutely symmetric 
set defined in Example |2.5| Then by Theorem |3.7| and Example |2.5[ 

C*{£) = C#(E3,2) = 6. 


3x3 


Furthermore, for a matrix Y G 
Y = 17(Diagcr(F))E^, by (3.2), the matrix 


along with a singular value decomposition 


U Diag 


(Ti(F) + (T2(F) cri(F) + cr2(F) 


>0 


is a nearest element of £ to Y. This is precisely Hartley’s result [16[ Theorem 5] 
which is well known in the computer vision community. The six ED critical points 
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of a general Y = t/(Diag <7{Y))V^ are the matrices 17(Diag x)V^ where x varies 
over the following vectors: 


<71 (y)+<72(Y) (Ti{Y) + (T2(Y) 

2 ’ 2 ’ 
^ ^l(y)+^3(y) ^ ai{Y) + a3(Y) ' 

2 ’ 2 
r Q o-2(y)+o-3(^) <72(y)+o-3(y) ^ 

i 2 ’ 2 


<7i(Y) —0'2(Y) 

2 ’ 


-<7i(Y) + <72(Y) pi I 

2 ’ ^ J 


<7l(Y)-cr3(y) PI -gi(Y)+g3(Y) 

2 ’ 2 

f n <72(y)-0'3(y) -<72(l^j+<73(y) ' 

i 2 ’ 2 


We end the section with the proof of the lemma evoked in the proof of Theorem |3.7[ 


Proof of Lemma 3.6 Before we get to the main part of the proof, we need some 
notation. To this end, define (2^ to be the set of t x n matrices with orthonormal 
columns. Note that (2^ is a smooth manifold of dimension nt — ^nd is 

called a Stiefel manifold] see for example Proposition A.4]. In particular (2" is a 
manifold of dimension . 'We deduce that the product manifold (2" x (2^ x K" 

has dimension exactly nt, same as For any vector x S M" let sDiag x 

denote the n x n diagonal matrix with x on the diagonal. Define now the mapping 
T:0^ xO^xR^ -)■ by setting r{U,V,x) = 17(sDiag x)V^. Notice that for 
the absolutely symmetric set S, we have equality r((2" x (2^ x S') = cr“^(S). 

Suppose now that S is a Lebesgue null set. Then clearly the set (2” x (2^ x S 
is Lebesgue null in the manifold (2” x (2^ x K”. Since T is smooth, the image 
r(C)” X 0 !f X S), which coincides with (t“^(S), is Lebesgue null as claimed. 

Conversely, suppose that cr“^(S) is Lebesgue null. It is well-known that T is a local 
diffeomorphism on an open full-measure subset of the domain manifold (2" x (2 ‘xM". 
To see this quickly, define the set 


A = {x G K" : \xi\ = \xj \ for some indices i ^ j}. 

Observe that since T is a semi-algebraic map, we may stratify O” x (2^ x 
into finitely many smooth manifolds A4i so that the restriction of T to each ^A^ 
has constant rank. Suppose now for the sake of contradiction that the derivative 
of T has deficient rank on some maximal dimensional manifold M.i. Then by 
the constant rank theorem [20[ Theorem 5.13], there exists a nontrivial smooth 
path {U{t), V{t),x{t)) in A4i so that T is constant on the path, meaning that the 
matrices U{t){sDiag x{t))V{t)^ are equal for all t. On the other hand, taking 
into account that the coordinates of x{t) are distinct and appealing to uniqueness 
of singular values and singular vectors, we obtain a contradiction. Hence T is a 
local diffeomorphism on an open full measure subset of ( 2 " x ( 2 ^ x K". It follows 
immediately that S is Lebesgue null, since otherwise the image r(( 2 ” x ( 2 ^ x S') 
would fail to be Lebesgue null. This completes the proof. □ 


4. Applications 

We now apply the techniques developed in the last section to calculate and count 
the ED critical points of several orthogonally invariant matrix sets. 

Example 4.1 (Matrices orthogonally equivalent to a given matrix). Fix a matrix 
A € We say X G is orthogonally equivalent to A if A = UAV^ for some 

U G (2" and V G O*. Let A4a Q be the set of matrices that are orthogonally 
equivalent to A. Then A4a is orthogonally invariant and A4a = cr~^(cr(A)). By 
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Theorem 13.71 


C*{MA) = \niW{A)}\=T 
where a{A) = (cti, ..., tJi,..., (7^,..., cTfc). 


ni!... rzfc! 


ni rik 

Example 4.2 (Real orthogonal group). The real orthogonal group O" is a special 
case of the previous example because O" is the set of n x n real matrices that are 
orthogonally equivalent to the n x n identity matrix, namely, 

0" = a-i((l,...,l)). 


11 


Hence C'^{0'^) = 2" which is also the ED degree of O" (cf. 

By Proposition 3.5 for any Y G with singular value decomposition Y = 


Theorem 3.2]). 


U{Q\a,ga{Y))V' where U,V G O", one has UV' G projo„(y). Moreover, if the 


singular values of Y are distinct, then by Theorem 3.7, the set of critical points 
of Y on O" is {{7(Diaga;)H^ : Xi = ±1 V i}. These results were also obtained by 
Draisma and Baaijens [11| using algebraic methods. 

Example 4.3 (Unit sphere of Schatten d-norm). The Schatten d-norm of a matrix 
X G is defined as 

ijd 


1X|L:= 






Note the following fact: ||t7XU^||d = ||X||d for any X G U G V G O*. 

Hence the unit sphere of the Schatten d-norm 

e : ||X|U = 1} 

is orthogonally invariant. Assume d > 2 is an even integer. One has !Fn,t,d = 
a~^{Fn^d) where F^^d is the affine Fermat hypersurface 


Fn.d ■■= {x G 


-- 




which is absolutely symmetric. Therefore, C'^{Fn,t,d) = C'^{Fn^d) for all t. 

Consider the special case n = 2. For simplicity, we set F^ := F 2 ^d- Then F 2 is 
the unit circle and C'^{F 2 ) = 2. Suppose d > 4. For any 1 / G K^, a point x is an ED 
critical point of y on F^ if and only if x lies on both F^ and the curve 

(4.1) 7 d := {a: G : x‘[~'^{x 2 - y 2 ) = X 2 ~^{xi - yi)}. 

The graph of 74 is shown in Figure]^ By symmetry, we may assume that yi > y 2 > 0. 
When yi is small, the “optimal” points, 7711 , 7712 , 7713 , 7714 , on the pieces of jd in the 
coordinate directions lie inside the curve Fd, namely, 

77ii G {x G : Xi + X 2 < 1} 

for 1 = 1,..., 4. Hence, Cp^ (y) < 8 for any point y except on a null set, and there is 
an open set V in such that Cp^{y) = 8 for any y GV. To sum up, by Theorem 


3.71 we have 


C*{J^2,t,d) = C*{Fd) = 


if d = 2 
if d > 4. 
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Figure 3. The curves and 7^ in Example |4.3| for d = 4 


Example 4.4 {SL^). This example was considered in 
orthogonally invariant set 


11 


Section 4], Define the 


:= {X G : det(X) = ±1} = 

where is the absolutely symmetric set defined as 


Hn := {a: G K” : Xi • • • = ±1}. 

Consider the special case n = 2. We evaluate C'^(S'LJ) by computing C'^(id 2 )- 
The set H 2 is a disjoint union of two hyperbolas: 

■= {x G : X1X2 = ± 1 }- 


Since these hyperbolas are disjoint, for any y, one has 

ctSy) = ctAy) + c*_{v). 

Notice that each of the sets C^± {y) is defined by the roots of a univariate quartic. 
Indeed, 

(y) ^ ^ ~ + 2;y2 - 1 = o| 

and 

(y) = I (^2:, G : qy{x) := - x^yi - X2/2 - 1 = o| . 


By computing the trace forms of these quartics and applying Sylvester’s criterion 
(see e.g. 28 Corollary 2.9]) we acquire these results: 


ctiy) = 


if D~^{y) > 0 OT D (y) > 0 
if D~^{y) < 0 and D~{y) < 0 


where the bivariate polynomials 

D+{y) := -256 + 192yij/2 + ^vlvl + ^vlvl - 27yt - 27yj 
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yi 


Figure 4. Values of for general y, in Example 


4.4 


and 


D- 


(y) := -256 - I92y,y2 + Qylyl - 4.ylyl - 27yt - 27yt 
are the discriminants of and g” respectively; see Figure Then by Theorem 
3.7 we know 

C*iSLt)=6. 


5. Connections to the Euclidean distance degree 


As mentioned in the introduction, this work was inspired by the general framework 


12 for counting the critical points of the squared Euclidean distance function on 


an algebraic variety. In this section we comment on the relationships between our 
results and those in 12 , beginning with some basic facts about real varieties [T|[M]. 

Consider a real variety V C K" whose vanishing ideal is generated by the 
polynomials /i,..., /s € ..., Xn]- Then the Zariski closure of V is Vc := {x G 

C" : fi(x) = ... = fs{x) = 0}, the smallest complex variety containing V. The 
real part of Vc, i-C., Vc n K”, is precisely V and hence, the vanishing ideal of Vc in 
C[a;i,..., Xn] is also generated by /i,..., /g. Also, both V and Vc have the same 
dimension over M and C respectively. Recall that if Vc = Uj=i ^ minimal 

irreducible decomposition of Vc, then for any j, the real part of W-^ , denoted by U\ 
is an irreducible real variety whose Zariski closure is WA Moreover, V = i® 

a minimal irreducible decomposition of V. For simplicity, in the rest of this section 
we will assume that Vc is irreducible. 

Denote by J(/) the s x n Jacobian matrix whose (j,j)-entry is Then a 

point a: G Vc is said to be regular if the rank of the Jacobian matrix evaluated at t, 
rank( J(/)(a:)), equals the codimension of Vc- Let V™® denote the regular points 
of Vc. It is known that V^®® is a Zariski open subset of Vc, and its complement 
is a proper subvariety in Vc, denoted as Sing(Vc), and called the singular locus of 
Vc. The set V’’®® of regular points in V is precisely the set of real points in Vc**®. 
However, while V™® is dense in Vc in the Euclidean topology, V*^®® may not be dense 
in V or even V*, the set of smooth (i.e., C^-smooth) points in V. The following 
example illustrates this behavior. 
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Figure 5. The Cartan umbrella 


Example 5.1 (Cartan umbrella). The Cartan umbrella is the real variety 

V = {{xi,X 2 ,X 3 ) G : X 3 {xl + xl) - xf = 0}. 

Here Vc is irreducible, as is V. On the other hand, V is also the union of a surface 
(which is not a real variety) and the xa-axis (Figure]^. No point on the Sa-axis is 
regular while all points on the Xa-axis, except the origin, are smooth. 

This example shows an important distinction between smooth and regular points 
on a real variety. Our goal in this section is to elaborate on this difference in the 
context of critical points as defined in 12 . Recall that the normal space at a regular 
point X G Vc, denoted as Afy^^(x), is the row space of J{f){x). Given a data point 
y G C", and a real variety V, the authors of 12 study the set of critical points 


(5.1) 




Note that in the above algebraic setting, even though our input is a real variety 
V C R", there is a natural passage into the complex numbers; the real variety V 
is replaced by the complex variety Vc, the data point can be any y G C”, and the 
critical points of y are regular points in Vc- This complexification yields the first key 


result in 12 , namely that for a fixed V, the number of critical points of a general 


data point y G C" is a constant. In other words, the cardinality of C^^{y), which 
we will denote as C'y^‘^®®(y), is a constant. This constant is called the Euclidean 
distance degree (EDdegree) of V in 12 . The phrase “general data point” refers 


to the fact that C^J^^{y) is different from EDdegree(V) only if y lies on certain 
proper subvarieties in C" which we will describe in the Appendix. We remark that 
if Vc is reducible, then EDdegree(V) is the sum of the ED degrees of its irreducible 
components. 

Our results in this paper provide a method to compute the (smooth) ED critical 
points of a real orthogonally invariant set of matrices. We did not require this set 
to be a real variety, and the notion of smoothness (of critical points) was geometric 
and not algebraic. Therefore, in order to connect our work to that in 12 , we 
need to restrict to real varieties V C R", data points y G R", and understand the 
relationships between the following two sets of critical points of y: 

• Cy {y ), the set of ED critical points of y on V as in Section and 

• C'v *(?/), the set of regular critical points of y on V. 
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As usual we denote the cardinalities of these sets by Cy (y) and Cy respectively. 
Also, note that Cy ^(y) = C^^{y) n K" = Cy^{y) n V. Since a regular point of V is 
C'f'-smooth for p G {c»,a;}, for any y we have, 

(5.2) C Cv( 2 /) and so, C* (y) < C*{y). 

However, for a given y, this inequality can be strict since there can be smooth points 
on V that are not regular as we saw in the Cartan umbrella. In fact, in the Cartan 


umbrella there is a dense set of y for which the inequality in (5.2) is strict since for 
any y with 1/3 ^ 0, there will be an ED critical point on the xa-axis. 

Moving on to invariants of the entire set V, on the one hand, we have the quantity 
C'^(V) = infree sup^gy^ C'y (y) from Sectionj^ and on the other hand, we have the 
constant EDdegree(V) which is an upper bound to whenever Cy®®(y) is 

finite. Since the inequality in ( |5.2[ ) can be strict, it is not clear that the number 
(7^(12) is always a lower bound to EDdegree(V). However, it is true under certain 
conditions. 

Theorem 5.2. IfV is a real variety such that is Euclidean dense in V*, then 
we have the inequality C"^(V) < EDdegree(V). 

This is an immediate consequence of the following lemma. Note that the Cartan 
umbrella does not satisfy the assumptions of Theorem |5.2[ 


Lemma 5.3. Consider a real variety V C M” (possibly reducible) such that is 
Euclidean dense in V*. Then there is an open set of points y G K" such that 

(5.3) for any irreducible component W ofV, = EDdegree(yV), 

(5.4) C*{y)=C*{y), and 

(5.5) Cv(y) C C™®(y). 


Proof. Fix an irreducible component W of V. Then from 12 Theorem 4.1] the set 
of all y G C" for which ^ EDdegree(yV’) lies in a proper subvariety D in 

C". Consider the real variety D® := D n K" which contains all the real points y 
with Ci^(f^{y) EDdegree(yV). The Zariski closure of D® is a subvariety of D with 
complex dimension equal to the real dimension of D®. Therefore, the real dimension 
of Dr is smaller than n, and hence Dr is a proper subvariety in K”. In particular, 
the complement of Dr in K" is an open full measure set. Taking the intersection of 
all such open full measure sets as we vary over the irreducible components of V, we 


still have an open full measure set, and any point y in this set satisfies (5.3). 
Since V is a variety, one has C^(V) < 00 by Proposition 


2.2 


Then by quantifier 

elimination, the set {y G K" : Cy{y) = C'^(V)} is semialgebraic. Moreover, this 
set cannot be Lebesgue null by the definition of Therefore, it must contain 


an open subset. Hence (5.4) holds for all y in some open set. 


We now show that (5.5) holds for any point y in an open full measure set, which 


will prove the lemma. For any z = 0,1,..., n consider the submanifold 
Mi := {x gV* : dim(7v(a;)) = i}. 

These submanifolds Mi clearly partition V*. We first claim Mi n c\{Mj) = 0 for all 
distinct pairs z,j G [n]. Note the symbol “cl” here denotes the Euclidean closure. 
Indeed, if there existed some point x G Mi n cl(Mj), then we would deduce for all 
points y G Mj sufficiently near x, the equality j = dim7v(y) = dim7v(ai) = z, a 
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contradiction. The middle equality follows from the fact that dim7v(') is constant 
in a neighborhood of a: in V. 

Now since V™® is dense in V*, the inclusion V* C cl(V''®®) = UiI"' ^i) 
holds. We claim that for any i, the set W®® n Mi is Euclidean dense in Mi, namely, 
Mi C c^V^^nMi). To see this, it suffices to establish Mincl(V"'®®niWj) = 0 for any 
j ^ i. This follows immediately by observing Mi n cl(V"'®® n Mj) C Mi n c\{Mj) = 0. 
It follows that for any i, the strict inequality 


dim(Mi \ (W®® n Mi)) < dimM, 

holds (see Proposition 3.16]). By dimension arguments, the dimension of the set 

IJ (x+Afvix)) 

is strictly less than n. In particular, its interior is empty. Taking the union over i, 


IJ (a:+A/'v(a;)) 

a:GV\V™E 


also has empty interior. Any y which is not in this set satisfies (5.5). 


□ 


We conclude the paper by comparing C'^(V) and EDdegree(V) in some of the 
examples we saw in Section]^ and Section]^ 

Example 5.4 (SL^). Every point in the Zariski closure of SL^ (c.f. Example |4.4[ ) 
is regular. Therefore by Theorem |5.2[ we obtain 

C*{SL^) < EDdegree(5'L±) = n2". 


The formula for EDdegree was derived in [Tl] 
inequality holds when n = 2. 


We saw in Example 4.4 that strict 


Example 5.5 (Unit sphere of Schatten d-norm). Consider the set 


2,2,d C 


p2x2 


given in Example 4.3 Eor any d the Zariski closure of JS, 2 ,d is regular everywhere. 
By Theorem |5.2[ 

EDdegree(J'2,2.d) > C*{J^2,2.d) = 8 , 


for d > 4. From Macaulay2 15 computations, EDdegree(J^ 2 , 2 ,d) equals 16, 34, 64,98 


when d = 4,6,8,10 respectively. This suggests the gap between C'^(J^ 2 , 2 ,d) and 
EDdegree(J^ 2 , 2 ,d) increases with d. 

Example 5.6. Let £ be the essential variety from It is an irreducible variety 

of codimension three in 10 Proposition 3.6]. By the transfer of smoothness 

(see the paragraph after the proof of Proposition 3.5), we know £* = £ \ {0}. 
Moreover, a straightforward computation verifies that any non-zero essential matrix 
is a regular point of £. Thus f®® = £* 


(5.6) 


= £ \ {0}, and by Theorem 5.2 we have 
EDdegree(£) > C'^{£) = 6. 


With completely different tools, but entirely motivated by this work, one can prove 
the equality EDdegree(£) = 6. The details will be described in a forthcoming paper. 
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Appendix A. 

The Cartan umbrella illustrates a yet unexplored feature of the EDdegree con¬ 
cerning the exceptional loci of points y at which can be different from 

EDdegree(V). We comment briefly on this topic which deserves further investigation. 

For each y G C", the set Cy ®(y) is the variety of a polynomial ideal called the 
critical ideal of y (see (2.1) in [^), and typically, this variety has EDdegree(V)-many 
distinct complex solutions. There are two ways in which the critical ideal of y can 
have fewer distinct roots; the first is because of roots with multiplicity, and the 
second is because a root may wander off into Sing(Vc) due to closure issues. These 
situations create two exceptional loci in the space of data points y. The locus of 
y G C" for which the critical ideal has roots with multiplicity is called the ED 
discriminant of Vc, and is denoted as Svc- The ED discriminant is typically a 
hypersurface in C" and can be computed from the equations of V (see Section 7 
in for algorithms and examples). We saw the ED discriminant of the parabola 
{{xi,X2) G : X2 = x\} in Figurej^ It is the curve defined by 

16y| - 21yl - 24y| + 12^2 -2 = 0. 


The second type of exceptional locus has not been studied in and was 
suggested to us by Bernd Sturmfels. To describe it, let us denote the copy of C” 
that contains Vc as C” and the copy that contains the data points y as Cy. The 


ED correspondence of Vc, denoted as Evj. and described in 12, Section 4], is the 
Zariski closure of: 


(A.l) {ix,y) e C: X C-y : X G Vp, y G C\ x - y G W^^(t)} 


By 12 Theorem 4.1], the ED correspondence is an irreducible variety of dimension n 


in C^xCy. It admits two natural projections and ny into C” and Cy respectively. 
Over general y G C”, the projection tt^ : Ey^. —Cy has finite fibers of cardinality 
equal to EDdegree(V). The exceptional locus we are looking for is the set of data 
points y G C" that have critical points that fall into the singular locus Sing(Vc). 
This is precisely the Zariski closure of the set 


(A.2) TTy (Ey, n (Sing(Vc) X C^)) 

which we call the ED data singular locus of Vc, and denote by D^c- This affine 
scheme deserves further study, and we illustrate both Dy^. and Evc for the Cartan 
umbrella in Example |A.1[ Note that Dyj, is empty for the parabola in Figure 
since its singular locus is empty. The upshot of the above discussion is that for all 
y G K” \ Dvc but in a connected component of Kj) \ Svc , the number of regular 
critical points on V, Cy’^®®(y), is a constant. 

Example A.l (Cartan umbrella continued). The ED discriminant of the Cartan 
umbrella in Example |5.1| is a degree 12 surface in the space of data points given by 
the following equation: 
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Figure 6 . The Cartan umbrella with its ED discriminant on the 
left and ED data singular locus on the right. 


256yP - 35328yl°yi - 108984yfy| - llimiytyl - 939751/^2/1 - 921Qylyl° 

-20482/2^2 - 23042/P2/3 - 21122/?2/i2/3 - 1492802/12/12/3 - 1168682/f2/f2/3 

+535322/?2/l2/3 + 345602/i2/2^°2/3 + 69122/i°2/l + 140 162/f 2/12/1 - 287642/f2/l2/i 
+415022/t2/fyi - 86 4 302/?2/l2/l - 7682/^2/1 - 79 362/?2/3 + 150720^12/1^1 
-200 1482/?yl2/3 - 4117282/?2/2 2/3 + 14762/12/12/1 + 9216y?2/3 - 46656y?2/i2/3 
+3 1 9082/fyl2/t + 1108172/?2/2®2/l + 49 532/12/1 - 27 6 482/12/1 + 238082/?2/i2/3 
+912362/?2/l2/3 - 402842/12/!2/1 + ‘28&72ytyt - 19&m2yiylyt - 240480//?2/1^1 
-2592yf 2/3 - 92162/?2/3 + 142082/?2/fy3 + 288002/i2/ly| + 27648//?y| 
+391682/?2/i2/f + 23 042/?2/f - 27648//?//! - 27 6 482/i2/?2/f = 0. 

The EDdegree of the Cartan umbrella is seven. The ED discriminant partitions 
into two regions where the real regular critical points are either one or three, 
while the number of ED critical points is two or four. In particular, for all y 
for which the polynomial defining the ED discriminant is positive, we get three real 
regular critical points. 

The ED data singular locus, Dvc, is the reducible surface in defined by 

iVi + 2/1) ■ (42/1 + 82/?2/I + 42/1 + + 36yiy^y3 + 27ylyl) = 0- 

(see Figure The data point y = (—2,—1,2) lies on Dvc but not on Evc- The 
critical ideal of this point has seven distinct complex roots but one of them is 
singular. Among the remaining six roots, two are real and, (—2, — 1 , 2) lies in the 
region where the polynomial defining Evc is positive. 

Acknowledgments. We thank Jan Draisma, Giorgio Ottaviani, and Bernd 
Sturmfels for helpful discussions. In particular, the notion of ED data singular locus 
was suggested to us by Sturmfels. 
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